CpG_MI: a novel approach for identifying functional CpG islands in mammalian genomes

نویسندگان

  • Jianzhong Su
  • Yan Zhang
  • Jie Lv
  • Hongbo Liu
  • Xiaoyan Tang
  • Fang Wang
  • Yunfeng Qi
  • Yujia Feng
  • Xia Li
چکیده

CpG islands (CGIs) are CpG-rich regions compared to CpG-depleted bulk DNA of mammalian genomes and are generally regarded as the epigenetic regulatory regions in association with unmethylation, promoter activity and histone modifications. Accurate identification of CpG islands with epigenetic regulatory function in bulk genomes is of wide interest. Here, the common features of functional CGIs are identified using an average mutual information method to differentiate functional CGIs from the remaining CGIs. A new approach (CpG mutual information, CpG_MI) was further explored to identify functional CGIs based on the cumulative mutual information of physical distances between two neighboring CpGs. Compared to current approaches, CpG_MI achieved the highest prediction accuracy. This approach also identified new functional CGIs overlapping with gene promoter regions which were missed by other algorithms. Nearly all CGIs identified by CpG_MI overlapped with histone modification marks. CpG_MI could also be used to identify potential functional CGIs in other mammalian genomes, as the CpG dinucleotide contents and cumulative mutual information distributions are almost the same among six mammalian genomes in our analysis. It is a reliable quantitative tool for the identification of functional CGIs from bulk genomes and helps in understanding the relationships between genomic functional elements and epigenomic modifications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting CpG Islands and Their Relationship with Genomic Feature in Cattle by Hidden Markov Model Algorithm

Cattle supply an important source of nutrition for humans in the world. CpG islands (CGIs) are very important and useful, as they carry functionally relevant epigenetic loci for whole genome studies. As a matter of fact, there have been no formal analyses of CGIs at the DNA sequence level in cattle genomes and therefore this study was carried out to fill the gap. We used hidden markov model alg...

متن کامل

Comparative analysis using K-mer and K-flank patterns provides evidence for CpG island sequence evolution in mammalian genomes

CpG islands are GC-rich regions often located in the 5' end of genes and normally protected from cytosine methylation in mammals. The important role of CpG islands in gene transcription strongly suggests evolutionary conservation in the mammalian genome. However, as CpG dinucleotides are over-represented in CpG islands, comparative CpG island analysis using conventional sequence analysis techni...

متن کامل

CG dinucleotide clustering is a species-specific property of the genome

Cytosines at cytosine-guanine (CG) dinucleotides are the near-exclusive target of DNA methyltransferases in mammalian genomes. Spontaneous deamination of methylcytosine to thymine makes methylated cytosines unusually susceptible to mutation and consequent depletion. The loci where CG dinucleotides remain relatively enriched, presumably due to their unmethylated status during the germ cell cycle...

متن کامل

Quick identification and localization of CpG islands in large genomic fragments by partial digestion with Hpa II and Hha I.

More than 50% of mammalian genes are associated with CpG islands and thus they serve as a good gene marker. We have devised a simple method to scan large pieces of native or cloned genomic DNA for CpG islands. The method is based on the presence of multiple Hpa II and Hha I sites in CpG islands, at a frequency 30 times higher than in the rest of the genome. The steps include complete digestion ...

متن کامل

CpG islands as genomic footprints of promoters that are associated with replication origins

The primary target for DNA methylation in mammalian genomes is cytosine in the dinucleotide CpG. High densities of CpG dinucleotides are found in CpG islands, but paradoxically CpG islands are normally in a non-methylated state. Here, we speculate why CpG islands are immune to methylation and why they are so rich in guanine and cytosine relative to the surrounding DNA. We propose that CpG islan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 38  شماره 

صفحات  -

تاریخ انتشار 2010